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Abstract 


Population protocols are networks of hnite-state agents, interacting randomly, and updating their 
states using simple rules. Despite their extreme simplicity, these systems have been shown to coopera¬ 
tively perform complex computational tasks, such as simulating register machines to compute standard 
arithmetic functions. The election of a unique leader agent is a key requirement in such computational 
constructions. Yet, the fastest currently known population protocol for electing a leader only has linear 
convergence time, and, it has recently been shown that no population protocol using a constant number 
of states per node may overcome this linear bound. 

In this paper, we give the hrst population protocol for leader election with polylogarithmic conver¬ 
gence time, using polylogarithmic memory states per node. The protocol structure is quite simple: each 
node has an associated value, and is either a leader (still in contention) or a minion (following some 
leader). A leader keeps incrementing its value and “defeats” other leaders in one-to-one interactions, 
and will drop from contention and become a minion if it meets a leader with higher value. Importantly, 
a leader also drops out if it meets a minion with higher absolute value. While these rules are quite sim¬ 
ple, the proof that this algorithm achieves polylogarithmic convergence time is non-trivial. In particular, 
the argument combines careful use of concentration inequalities with anti-concentration bounds, show¬ 
ing that the leaders’ values become spread apart as the execution progresses, which in turn implies that 
straggling leaders get quickly eliminated. We complement our analysis with empirical results, showing 
that our protocol converges extremely fast, even for large network sizes. 

1 Introduction 

Recently, there has been significant interest in modeling and analyzing interactions arising in biological 
or bio-chemical systems through an algorithmic lens. Several interesting computational models have been 
proposed for such networks, for example, the cellular automata model [Neu66], the stone-age distributed 
computing model [EW13], or the population model [AAD+06]. 

In particular, population protocols [AAD''‘06], which are the focus of this paper, consist of a set of n 
finite-state nodes interacting in pairs, where each interaction may update the states of both participants. The 
goal is to have all nodes converge on an output value, which represents the result of the computation, usually 
a predicate on the initial state of the nodes. The set of interactions occurring at each step is assumed to be 
decided by an adversarial scheduler, which is usually subject to some fairness conditions. The standard 
scheduler when computing convergence bounds is the probabilistic (uniform random) scheduler [AAE08, 
PVV09, DV1 2, MNRS1 4] , which picks the next pair to interact uniformly at random in each step. We adopt 
this probabilistic scheduler model in this paper. (Some references refer to this model as the probabilistic 
population model.) The fundamental measure of convergence is parallel time, defined as fhe number of 
scheduler steps until convergence, divided by re. ' 

‘Work performed in part while an intern with Microsoft Research. 

* An alternative definition is when reactions occur in parallel according to a Poisson process [PVV09,DV12]. 


1 



The class of predicates computable by population protocols is now well-understood [AAD+06, AAE06, 
AAER07] to consist precisely of semilinear predicates, i.e. predicates definable in first-order Presburger 
arithmetic. The first such construction was given in [AAD+06], and later improved in terms of convergence 
time in [AAE06]. A parallel line of research studied the computability of deterministic functions in chem¬ 
ical reaction networks, which are also instances of population protocols [CDS 14]. All three constructions 
fundamentally rely on the election of a single initial leader node, which co-ordinates phases of computation. 

Reference [AAD"''06] gives a simple protocol for electing a leader from a uniform population, based 
on the natural idea of having leaders eliminate each other directly through symmetry breaking. Unfortu¬ 
nately, this strategy takes at least linear parallel time in the number of nodes n: for instance, once this 
algorithm reaches two surviving leaders, it will require U(n^) additional interactions for these two leaders 
to meet. Reference [AAE06] proposes a significantly more complex protocol, conjectured to be sub-linear, 
and whose convergence is only studied experimentally. These references posit the existence of a sublinear¬ 
time population protocol for leader election as a “pressing” open problem in the area. In fact, the existence 
of a poly-logarithmic leader election protocol would imply that any semilinear predicate is computable in 
poly-logarithmic time by a uniform population [ AAE06] . 

Recently, Doty and Soloveichik [DS15] showed that D(n^) expected interactions are necessary for elect¬ 
ing a leader in the classic probabilistic protocol model in which each node only has constant number of 
memory states (with respect to n). This negative result implies that computing semilinear predicates in 
leader-based frameworks is subject to the same lower bound. In turn, this motivates the question of whether 
faster computation is possible if the amount of memory per node is allowed to be a function of n. 

Contribution: In this paper, we solve this problem by proposing a new population protocol for leader 
election, which converges in O(log^n) expected parallel time, using O(log^n) memory states per node. 
Our protocol, called LM for Leader-Minion, roughly works as follows. Throughout the execution, each 
node is either a leader, meaning that it can still win, or a minion, following some leader. Each node state is 
associated to some absolute value, which is a positive integer, and with a sign, positive if the node is still in 
contention, and negative if the node has become a minion. 

If two leaders meet, the one with the larger absolute value survives, and increments its value, while the 
other drops out, becoming a minion, and adopting the other node’s value, but with a negative sign. (If both 
leaders have the same value, they both increment it and continue.) If a leader meets a minion with smaller 
absolute value than its own, it increments its value, while the minion simply adopts the leader’s value, but 
keeps the negative sign. Conversely, if a leader meets a minion with larger absolute value than its own, then 
the leader drops out of contention, adopting the minion’s value, with negative sign. Einally, if two minions 
meet, they update their values to the maximum absolute value between them, but with a negative sign. 

These rules ensure that, eventually, a single leader survives. While the protocol is relatively simple, 
the proof of poly-logarithmic time convergence is non-trivial. In particular, the efficiency of the algorithm 
hinges on the minion mechanism, which ensures that a leader with high absolute value can eliminate other 
contenders in the system, without having to directly interact with them. 

Roughly, the argument is based on two technical insights. Eirst, consider two leaders at a given time T, 
whose (positive) values are at least 0(logn) apart. Then, we show that, within O(logn) parallel time from 
T, the node holding the smaller value has become a minion, with constant probability. Intuitively, this holds 
since 1) this node will probably meet either the other leader or one of its minions within this time interval, 
and 2) it cannot increase its count fast enough to avoid defeat. Eor the second part of the argument, we show 
via anti-concentration that, after parallel time ©(log^ n) in the execution, the values corresponding to an 
arbitrary pair of nodes will be separated by at least D(log n). 

We ensure that the values of nodes cannot grow beyond a certain threshold, and set the threshold in 
such a way that the total number of states is 0(log^ n). We show that with high probability the leader will 
be elected before the values of the nodes reach the threshold. In the other case, remaining leaders with 
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threshold values engage in a backup dynamics where minions are irrelevant and leaders defeat each other 
when they meet based on random binary indicators which are set using the randomness of the scheduler. 
This process is slower but correct and only happens with very low probability, allowing to conclude that the 
LM algorithm converges to a single leader within 0(log^ n) parallel time, both with high probability and in 
expectation, using 0(log^ re) states. 

In population protocols, in every interaction, one node is said to be the initiator, the other is the respon¬ 
der, and the state update rules can use this distinction. In our protocol, this would allow a leader (the initiator 
in the interaction) to defeat another leader with the same value (the responder), and could also simplify the 
backup dynamics of our algorithm. However, our LM algorithm has the nice property that the state update 
rules can be made completely symmetric with regards to the initiator and responder roles. ^ 

Summing up, we give the first poly-logarithmic time protocol for electing a leader from a uniform 
population. We note that n(relogre) interactions seem intuitively necessary for leader election, as this 
number is required to allow each node to interact at least once. However, this idea fails to cover all possible 
reaction strategies if nodes are allowed to have arbitrarily many states. 

We complement our analysis with empirical data, suggesting that the convergence time of our protocol 
is close to logarithmic, and that in fact the asymptotic constants are small, both in the convergence bound, 
and in the upper bound on the number of states the protocol employs. 

Related Work: We restrict our attention to work in the population model. The framework of population pro¬ 
tocols was formally introduced in reference [AAD+Oh], to model interactions arising in biological, chemi¬ 
cal, or sensor networks. It sparked research into its computational power [AAD’'‘06, AAE06, AAER07], and 
into the time complexity of fundamental tasks such as majority [AAE08,PVV09,DV12,MNRS14,AGV15], 
and leader election [AAD+06, AAE06].^ References interested in computability consider an adversarial 
scheduler which is restricted to he fair, e.g., where each agent interacts with every other agent infinitely 
many times. Eor complexity bounds, the standard scheduler is uniform, scheduling each pair uniformly at 
random at each step, e.g., [AAE08,PVV09,DV12,MNRS14]. This model is also known as the probabilistic 
population model. 

To the best of our knowledge, no population protocol for electing a leader with sub-linear convergence 
time was known before our work. References [AAD+06, AAE06,CDS 14] present leader-based frameworks 
for population computations, assuming the existence of such a node. The existence of such a sub-linear 
protocol is stated as an open problem in [AAD+06, AAE06]. Reference [DH13] proposes a leader-less 
framework for population computation. 

Recent work by Doty and Soloveichik [DS15] showed an D(re^) lower bound on the number of inter¬ 
actions necessary for electing a leader in the classic probabilistic protocol model in which each node only 
has constant number of memory states with respect to the number of nodes re [AAER07]. The proof of this 
result is quite complex, and makes use of the limitation that the number of states remains constant even as 
the number of nodes re is taken to tend to infinity. 

Thus, our algorithm can be interpreted as a complexity separation between population protocols which 
may only use constant memory per node, and protocols where the number of states is allowed to be a 
function of re. 

A parallel line of research studied self-stabilizing population protocols, e.g., [AAEJ06,EJ06,SNY''' 10], 
that is, protocols which can converge to a correct solution from an arbitrary initial state. It is known that 
stable leader election is impossible in such systems [AAEJ06]; references [FJ06, SNY''"10] circumvent this 


^For this reason, LM algorithm works for n > 2 nodes, because to elect a leader among two nodes it is necessary to rely on the 
initiator-responder role distinction. 

^The best known upper bound for deterministic majority is of 0(log n log s -\- log n/{es)) parallel time [AGV 1 5], where n is 
the number of nodes, s is the number of states per node, and e is the initial node difference between the two input states. The two 
problems are complementary, and no complexity-preserving transformations exist, to our knowledge. 
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impossibility by relaxing the problem semantics. Our algorithm is not affected by this result since it is not 
self-stabilizing. 

2 Preliminaries 

Population Protocols: We assume a population consisting of n agents, or nodes, each executing as a deter¬ 
ministic state machine with states from a finite set Q, with a finite set of input symbols X O Q, a finite set 
of output symbols Y, a transition function 6 : Q x Q ^ Q x Q, and an output function j : Q ^ Y. Initially, 
each agent starts with an input from the set X, and proceeds to update its state following interactions with 
other agents, according to the transition function 6. For simplicity of exposition, we assume that agents have 
identifiers from fhe sef V = {1, 2,..., n}, alfhough fhese identifiers are nof known fo agenfs, and nof used 
by fhe protocol. 

The agenfs’ inferacfions proceed according fo a direcfed interaction graph G wifhouf self-loops, whose 
edges indicafe possible agenf inferacfions. Usually, fhe graph G is considered fo be fhe complefe graph on 
n vertices, a convenfion we also adopf in fhis paper. 

The execufion proceeds in steps, or rounds, where in each sfep a new edge (u, m) is chosen uniformly af 
random from fhe sef of edges of G. Each of fhe fwo chosen agenfs updates ifs sfafe according fo function 5. 

Parallel Time: The above sefup considers sequential inferacfions; however, in general, inferacfions befween 
pairs of disfincf agenfs are independenf, and are usually considered as occurring in parallel. In particular, if 
is customary to define one unif of parallel time as n consecutive steps of fhe protocol. 

The Leader Election Problem: In fhe leader election problem, all agenfs sfarf in fhe same inifial sfafe A, 
i.e. fhe only sfafe in fhe inpuf sef X = {A}. The oufpuf sef is U = { Win, Lose}. 

A populafion profocol solves leader election wifhin i. steps wifh probabilify 1 — (f), if holds wifh 
probability l — (f) fhaf for any configuration c : V —)■ Q reachable by fhe protocol afler > £ steps, fhere exisfs 
a unique agenf i such fhaf, (1) for fhe agenf i, 'y(c(i)) = Win, and, (2) for any agenf j / i, '^{c{j)) = Lose. 

3 The Leader Election Algorithm 

In fhis secfion, we describe fhe LM leader elecfion algorifhm. The algorifhm has an infeger parameter 
m > 0, which we sef to 0(log^ n). Each slate corresponds to an integer value from fhe sef {—m, —m + 
1,..., —2, —1,1, 2, m — 1, m, m -|- 1}. Respectively, fhere are 2m + 1 differenl slates. We will refer to 
sfales and values interchangeably. All nodes sfarf in fhe same sfafe corresponding fo value 1. 

The algorifhm, specified in Eigure 1, consisls of a sef of simple deferminislic update rules for fhe node 
sfafe. In fhe pseudocode, fhe node sfales before an inleraclion are denoled by x and y, while Iheir new sfales 
are given by x' and y'. All nodes sfarf wifh value 1 and conlinue to inleracl according fo fhese simple rules. 
We prove fhaf all nodes excepf one will converge fo negalive values, and fhaf convergence is fasl wifh high 
probability. This solves fhe leader elecfion problem since we can define 7 as mapping only positive sfales fo 
Win (a leader).^ 

Since positive sfales Iranslale to being a leader according to 7 , we call a node a contender if if has a 
posifive value, and a minion olherwise. We presenl fhe algorifhm in defail below. The slate updates (i.e. fhe 
Iransifion function 6) of fhe LM algorifhm are completely symmelric, fhaf is, fhe new sfafe x' depends on x 
and y (lines 2-4) exaclly as y' depends on y and x (lines 5-7). 

If a node is a conlender and has absolule value nof less lhan fhe absolule value of fhe inleraclion partner, 
Ihen fhe node remains a contender and updales ifs value using fhe contend-priority function (lines 3 and 6 ). 
The new value will be one larger lhan fhe previous value excepf when fhe previous value was m -|- 1, in 
which case fhe new value will be m. 

'’Alternatively, 7 that maps only two states with values m and m -|-1 to WIN would also work, but we will work with positive 
“leader” states for the simplicity of presentation. 
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Parameters; 


m, an integer > 0, set to 0(log^ n) 

State Space; 

LeaderStates = {1, 2,. .., m — 1, m, m + 1}, 
MinionStates = {—1, —2, ..., —m + 1, —m}. 
Input; States of two nodes, x and y 
Output; Updated states x' and y' 

Auxiliary Procedures; 


true 

false 


is-contender (x) = 
contend-priority{x,y) = 
minion-priority {x,y) = 


if a; S LeaderStates', 
otherwise. 

m ifmax(|x 

max(|a:|, |y|) + 1 
—m 

-max(|a;|, |j/|) 


otherwise. 
ifmax(|x|,|j/|) = 
otherwise. 


= m + 1; 
m + 1; 


1 procedure update(a;, y) 

2 if is-contender{x) and |a;| > | 2 /| then 

3 a;' contend-priority{x,y) 

4 else a:' ■<— minion-priority {x,y) 

5 if is-contender{y) and \y\ > |x| then 

6 y' contend-priority {x,y) 

1 else y' ^ minion-priority {x,y) 

Figure 1; The state update rules for the LM algorithm. 


If a node had a smaller absolute value than its interaction partner, or was a minion already, then the 
node will be a minion after the interaction. It will set its value using the minion-priority function, to either 
— max(|x|, |r/|), or —m if the maximum was m + 1 (lines 4 and 7). 

Values m + 1 and m are treated exactly the same way by minions (essentially corresponding to —m). 
These values serve as a binary tie-breaker among the contenders that ever reach the value m, as will become 
clear from the analysis. 

4 Analysis 

In this section, we provide a complete analysis of our leader election algorithm. 

Notation; Throughout this proof, we denote the set of n nodes executing the protocol by V. We measure 
execution time in discrete steps (rounds), where each step corresponds to an interaction. The configuration 
at a given time f is a function c ; U —)• Q, where c{v) is the state of the node v at time t. (We omit the 
explicit time t when clear from the context.) We call a node contender when the value associated with its 
state is positive, and a minion when the value is negative. As previously discussed, we also assume that 
n > 2. Also, for presentation purposes, consider n to be a power of two. 

We first prove that the algorithm never eliminates all contenders and that a configuration with a single 
contender means that a leader is elected. 

Lemma 4.1. There is always at least one contender in the system. After an execution reaches a configuration 
with only a single node v being a contender, then from this point, v will have c(v) > 0 (mapped to WIN by 
7 ) in every reachable future configuration c, and there may never be another contender. 

Proof. By the structure of the algorithm, a node starts as a contender and may become a minion during 
an execution, but a minion may never become a contender again. Moreover, an absolute value associated 
with the state of a minion node can only increase to an absolute value of an interaction partner. Finally, an 
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absolute value may never deerease exeept from m + 1 to m. 

Let us assume for eontradietion that an exeeution reaehes a eonfiguration where all nodes are minions. 
Consider this time point T and let the maximum absolute value of the nodes be u. Beeause the minions 
eannot inerease the maximum absolute value in the system, there must have been a eontender with value u 
during the exeeution before time T. In order for this eontender to have beeome a minion by time T, it must 
have interaeted with another node with an absolute value strietly larger than u. However, an absolute value 
of a node never deereases exeept from m + 1 to m, and despite existenee of a larger absolute value than 
u before time T, u was the largest absolute value at time T, thus u must be equal to m. But after sueh an 
interaetion the seeond node that was in the state m + 1 remains a eontender with value m. Now it must have 
interaeted with yet another node of value m + 1 before time T in order to itself beeome a minion, but then 
this another node is left as a eontender with value m and the same reasoning applies to it. Our proof follows 
by infinite deseent. 

Consequently, whenever there is a single eontender in the system, it must have the largest absolute 
value. Otherwise, it eould interaet with a node with a larger absolute value and beeome a minion itself, 
eontradieting the proof above that all nodes may never be minions. Beeause of this invariant, the only 
eontender may never beeome a minion and the minions ean never beeome eontenders again. □ 


Now we turn our attention to the eonvergenee speed (assuming n > 2) of the LM algorithm. Our goal 
is bound the number of rounds neeessary to eliminate all exeept a single eontender. In order for a eontender 
to get eliminated, it must eome aeross a larger value of another eontender, the value possibly eondueted 
through a ehain of multiple minions via multiple interaetions. 

We first show by a rumor spreading argument that if the differenee between the values of two eontenders 
is large enough, then the eontender with the smaller value will beeome a minion within the next 0{n log n) 
rounds, with eonstant probability. Then we use anti-eoneentration bounds to establish that for any two 
fixed eonfenders, given fhaf no absolufe value in fhe sysfem reaehes m, after every 0{n log^ n) rounds fhe 
differenee befween fheir values is large enough wifh eonsfanf probabilify. 

Lemma 4.2. Consider two contender nodes with values ui and U2, where ui — U2 > 4,^ log n at time T for 
^ > 8. Then, after ^nlogn rounds from T, the node that initially held the value U 2 will be a minion with 
probability at least 1/24, independent of the history of previous interactions. 


Proof Call a node fhaf has an absolufe value of af teas! ui an up-to-date node, and out-of-date ofherwise. 
Inifially, af teas! one node is up-fo-dafe. Before any round, if we have x up-fo-dafe nodes, fhe probabilify 
fhaf an ouf-of-dafe node inferaefs wifh an up-fo-dafe node in fhis round inereasing fhe number of up-fo-dafe 


nodes fo x -|- 1, is 


2x{n—x) 

n{n—l) 


By a Coupon Colleefor argumenf, fhe expeefed number of rounds unfil every 


node is up-fo-dafe is fhen ^ ^ (s + yh) ^ 2nlogn. 

Now by Markov’s inequalify, fhe probabilify fhaf nof all nodes are up-fo-dafe affer log n commu- 
nieafion rounds is af mosf 2/^. Henee, expeefed number of nodes Y fhaf are up-fo-dafe affer ^nlogn 

Lef q be fhe probabilify fhaf fhe number of nodes affer log n 


eommunieafion rounds is af teas! 




whieh 


communication rounds is at least f + 1 - Then we have qn {1 — q){^ 1) >E[y] > —^ 

implies that q >l for n > 2 and ^ > 8. 

Hence, with probability at least 1/4, at least n/3 -|- 1 are nodes are up to date after log n rounds. By 
symmetry, the n/3 up-to-date nodes except the original node are uniformly random among the other n — 1 
nodes. Therefore, any given node, in particular the node that had value U 2 at time T has probability at least 
1/4 - 1/3 = 1/12 to be up-to-date after ^nlogn rounds from T. Notice that when a node that was initially 
holding value U 2 becomes up-to-date and gets an absolute value of at least ui from an interaction, it must 
become a minion by the structure of the algorithm if its value before this interaction was still strictly smaller 
than ui. Thus, we only need to show that the probability of selecting the node that initially had value U 2 at 
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least 4^ log n times (so that its value can reach ui) during these log n rounds is at most 1/24. The claim 
then follows by Union Bound. 

In each round, the probability to select this node that initially held tt 2 is 2/n. Let us describe the number 
of times it is selected in ^nlogn rounds by considering a random variable Z ~ Bin(^nlogn, 2/n). By 
Chernoff Bound, the probability being selected at least 4^ log n times in these rounds is at most: 

Ft[Z > 4^1ogn] =Fr[Z > (2^1ogn) (1 + 1)] < exp logn^ < ^ 

finishing the proof. □ 

The next Lemma shows that, after 0(nlog^ n) rounds, the difference between the values of any two given 
contenders is high, with reasonable probability. 

Lemma 4.3. Fix an arbitrary time T, and a constant ^ > 1. Consider any two contender nodes at time T, 
and time Ti which is 32^^n log^ n rounds after T. 

If no absolute value of any node reaches m at any time until Ti, then, with probability at least ^ 
at time Ti, either at least one of the two nodes have become minions, or the absolute value of the difference 
of the two nodes’ values is at least 4(^ log n. 


Proof We will assume that no absolute value reaches m at any point until time Ti and that the two nodes 
are still contenders at Ti. We should now prove that the difference of values is large enough. 

Consider the 32^^n log^ n rounds following time T. If a round involves an interaction with exactly one 
of the two fixed nodes we call if a spreading round. For each round, probabilify fhaf if will be spreading is 
which for n > 2 is af leasf 2/n. So, we can describe fhe number of spreading rounds among fhe 

2,2ffn log^ n rounds by considering a random variable X Bin(32^^n log^ n, 2/n). Then, by Chernoff 
Bound, fhe probabilify of having af mosf 32^^ log^ n spreading rounds is af mosf 


Pr \X < 32^^ log^ n] = Pr 


X < 64,^^ log^ n 



< exp 


Uff\og^n\ ^ ^ 1 

22-2 )- ^ 


Lef us from now on focus on fhe high probabilify evenf fhaf fhere are af leasf 32^^ log^ n spreading rounds 
befween fimes T and Ti, and prove fhaf fhe desired difference will be large enough wifh probabilify This 
implies fhe claim by Union Bound wifh fhe above even! (note fhaf for n > 2, ^ ^ holds). 

We assumed fhaf bofh nodes remain confenders during fhe whole fime, hence in each spreading round, 
a value of exacfly one of fhem, wifh probabilify 1/2 each, increases by one. Wifhouf fhe loss of generalify 
assume fhaf af fime T, fhe value of fhe firsl node was larger fhan or equal fo fhe value of fhe second node. Lef 
us now focus on fhe sum 5 of fc independenf uniformly disfribufed ±1 Bernoulli frials Xi where 1 < f < fc, 
where each frial corresponds fo a spreading round and oufcome +1 means fhaf fhe value of fhe firsf node 
increased, while —1 means fhaf fhe value of fhe second node increased. In fhis ferminology, we are done if 
we show fhaf Pr [S' > 4^ log re] > ^ for A; > 32^^ log^ re frials. 

However, we have fhaf: 


Pr[5 > 4^ log re] > 
> 


Pr[|S'| > 4^ log re] Pr[|5^| > 16^^ log^ re] 
2 “ 2 
Pr[\S^\ > k/2] _ Pr[|52| > E[S^]/2] 

2 “ 2 


(4.1) 

(4.2) 


where 4.1 follows from fhe symmefry 
Pr[S' < —4^1ogre]. For 4.2 we have 


> 


1 E[52l2 


> 


1 


(4.3) 


22.2 E[54] 

of fhe sum wifh regards fo fhe sign, fhaf is, from Pr[S' > 4^ log re] = 
used fhaf k > 32^^ log^ re and E[S'^] = k (more abouf Ibis below). 
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Finally, to get 4.3 we use Paley-Zygmund inequality and the fact that E[S'^] = 3k{k — 1) + k < 3/c^. Eval¬ 
uating E[5^] and E[S'^] is simple by using definition of S and the linearity of expectation. The expectation 
of each term then is either 0 or 1 and it suffices fo count the number of terms with expectation 1 , which are 
exactly the terms where each multiplier is raised to an even power. □ 

Now we are ready to prove the convergence speed with high probability 

Theorem 4.4. There exists a constant a, such that for any constant /3 > 3 following holds: If we set 
m = a/31og^n = 0(log^n), the algorithm elects a leader (i.e. reaches a configuration with a single 
contender) in at most 0{n log^ n) rounds (i.e. parallel time 0(log^ n)) with probability at least 1 — 1 /n^. 


Proof. Let us fix ^ > 


I large enough such that 
1 

^ ' 


1 


1 


>P- 


(4.4) 


for a constant 0 < p < 1. Let /3 be any constant > 3 and take a = y • (33^^). We set m = a(5 log'^ n 

and consider the first communication rounds of the algorithm execution. Lor any fixed node, the 

probability that it interacts in each round is 2/n. Let us describe the number of times a given node interacts 


within the first 


a(3n log'^ n 


rounds by considering a random variable B ~ Bin(-j 

Bound, the probability being selected at least m times in these rounds is at most: 

aj5 log^ n 


2/n). By Chernoff 


Pr \B >m]= Pr 


B > 


(1 + 1 ) 


< exp (^-^log^n) < 2 f^ 

Taking Union Bound over all n nodes, with probability at least 1 — , all nodes interact strictly less than 

m times during the first ^ rounds. 

Let us from now on focus on the high probability event in above, which means that all absolute values 
are strictly less than m during the first ^ = ^(33^^)nlog^ n rounds. Lor a fixed pair of nodes, this 

allows us to apply Lemma 4.3 followed by Lemma 4.2 (with parameter 0 p( 32 ^ 2 ^n\o^n+^n log n) — 

times. Each time, first by Lemma 4.3, after 320n log^ n rounds with probability at least + — ^ the nodes 
end up with values at least 4^ log n apart, in which case, after the next ^n log n rounds, by Lemma 4.2, one 
of the nodes becomes a minion with probability at least 1/24. Since Lemma 4.2 is independent from the 
interactions that precede it, by (4.4), each of the times if both nodes are contenders, we get probability 

at least p that one of the nodes becomes a minion. Consider a random variable W Bin(M|gE,^).The 
probability that both nodes in a given pair are still contenders after the first rounds is thus at most 

(1 — p) p < By Union Bound over all < 7^2 probability at least 

1 — for every pair of nodes, one of them is a minion after communication rounds. Hence, 

with this probability, there will be only one contender. 

Linally, combining with the conditioned event that none of the nodes interact m or more times gives 
that after the first = 0{n log^ n) rounds there must be a single contender with probability at least 

1 “ ^ ~ + 1 “ for + 3. A single contender means that leader is elected by Lemma 4.1. □ 


Linally, we can prove the expected convergence bound 

Theorem 4.5. There is a setting of parameter m of the algorithm such that m = 0(log^ n), such that the 
algorithm elects the leader in expected 0{n log^ n) rounds of communication, i.e. parallel time 0 (log^ n). 























Figure 2: The performance of the LM protocol. Both axes are logarithmic. The dots represent the results of 
individual experiments (100 for each network size), while the solid line represents the mean value for each 
network size. 


Proof. Let us prove that from any configuration, the algorithm elects a leader in expected O(nlog^re) 
rounds. By Lemma 4.1, there is always a contender in the system and if there is only a single contender, 
then a leader is already elected. Now in a configuration with at least two contenders consider any two of 
them. If their values differ, then with probability at least I/tt? these two contenders will interact in the 
next round and the one with the lower value will become a minion (after which it may never be a contender 
again). If the values are the same, then with probability at least 1/n, one of these nodes will interact with 
one of the other nodes in the next round, leading to a configuration where the values of our two nodes differ^, 
from where in the next round, independently, with probability at least 1/n^ these nodes meet and one of 
them again becomes a minion. Hence, unless a leader is already elected, in any case, in every two rounds, 
with probability at least l/vf the number of contenders decreases by 1. 

Thus the expected number of rounds until the number of contenders decreases by 1 is at most 2n^. In 
any configuration there can be at most n contenders, thus the expected number of rounds until reaching a 
configuration with only a single contender is at most 2(n — l)v? < 2n^ from any configuration. 

Now using Theorem 4.4 with /3 = 4 we get that with probability at least 1 — 1/n^ the algorithm converges 
after 0(n log^ n) rounds. Otherwise, with probability at most 1/n^ it ends up in some configuration from 
where it takes at most 2n‘^ expected rounds to elect a leader. The total expected number of rounds is therefore 
also 0(n log^ n) + 0(1) = 0(n log^ n), i.e. parallel time 0(log^ n). □ 

5 Experiments and Discussion 

Empirical Data: We have also measured the convergence time of our protocol for different network sizes. 
(Figure 2 presents the results in the form of a log-log plot.) The protocol converges to a single leader quite 
fast, e.g., in less than 100 units of parallel time for a network of size 10®. This suggests that the constants 
hidden in the asymptotic analysis are small. The shape of the curve confirms the poly-logarithmic behavior 
of the protocol. 

Discussion: We have given the first population protocol to solve leader election in poly-logarithmic time, 
using a poly-logarithmic number of states per node. Together with the results of [AAE06], the existence of 
our protocol implies that population protocols can compute any semi-linear predicate on their input in time 
0{n log® n), with high probability, as long as memory per node is poly-logarithmic. 


^This is always true, even when the new value is not larger, for instance when the values were equal to m -|- 1, the new value 
of one of the nodes will he m f m + 1. 
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Our result opens several avenues for future research. The first concerns lower bounds. We conjecture 
that the lower bound for leader election in population protocols is 0(log n), irrespective of the number of 
states used by the protocol. Further, empirical data suggests that the analysis of our algorithm can be further 
tightened, cutting off logarithmic factors. It would also be interesting to prove a tight a trade-off between 
the amount of memory available per node and the running time of the protocol. 
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